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ABSTRACT 



The integrity of descending white matter pathways, measured by fractional anisotropy from DW-MRI, is a key 
prognostic indicator of motor recovery after stroke. Barriers to translation of fractional anisotropy measures 
into routine clinical practice include the time required for manually delineating volumes of interest (VOIs), 
and inter-examiner variability in this process. This study investigated whether registering and then editing tem- 
plate volumes of interest 'as required' would improve inter-examiner reliability compared with manual delinea- 
tion, without compromising validity. MRI was performed with 30 sub-acute stroke patients with motor deficits 
(mean NIHSS = 11, range 0-17). Four independent examiners manually delineated VOIs for the posterior 
limbs of the internal capsules on Tl images, or edited template VOIs that had been registered to the Tl images 
if they encroached on ventricles or basal ganglia. Fractional anisotropy within each VOI and interhemispheric 
asymmetry were then calculated. We found that 13/30 registered template VOIs required editing. Edited tem- 
plate VOIs were more spatially similar between examiners than the manually delineated VOIs (p = 0.005). 
Both methods produced similar asymmetry values that correlated with clinical scores with near perfect levels 
of agreement between examiners. Contralesional fractional anisotropy correlated with age when edited template 
VOIs were used but not when VOIs were manually delineated. Editing template VOIs as required is reliable, in- 
creases the validity of fractional anisotropy measurements in the posterior limb of the internal capsule, and is 
less time-consuming compared to manual delineation. This approach could support the use of FA asymmetry 
measures in routine clinical practice. 

© 2013 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license 

(http://creativecommons.Org/licenses/by-nc-nd/3.0/). 



1. Introduction 

Diffusion tensor imaging provides information about tissue compo- 
sition that reflects the microstructural integrity of white matter tracts 
in the brain (Basser, 1995; Basser and Pierpaoli, 1996). Fractional an- 
isotropy (FA) quantifies the extent to which water diffusion is 
directionally restricted and is the most common DTI parameter used 
to assess white matter integrity (Jang, 2010). Disruption of white matter 
tracts results in less restriction on water diffusion and a lowering of the 
FA value (Basser and Pierpaoli, 1996; Werring et al., 2000). After stroke, 
reduced white matter integrity can occur acutely within the primary 
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lesion location, and can be a delayed and distant consequence of anter- 
ograde and/or retrograde axonal degeneration (Thomalla et al., 2004; 
Werring et al., 2000). FA derived from DTI correlates with upper limb 
function in chronic stroke patients (Stinear et al., 2007), and can be 
used to predict recovery of upper limb motor function at both early 
(Jang et al., 2005, 2008; Maeda et al., 2005; Stinear et al., 2012) and 
chronic stages of stroke (Stinear et al., 2007). 

Two approaches to DTI-based analysis can be used to evaluate FA 
values in white matter tracts. The first is white matter tractography, in 
which three-dimensional reconstructions of tract trajectories are calcu- 
lated from the DTI vector field (Mori and van Zijl, 2002). Regions of in- 
terest are defined, typically on a single axial slice, as seeds, waypoints 
and endpoints for subsequent tractography. Mean FA values along the 
tracts can then be compared between contralesional and ipsilesional 
hemispheres (Borich et al„ 2012; Puig et al., 2012; Wakana et al., 
2007). This approach can be confounded by anatomical variations or 
stroke lesions, and requires at least a part of the tract of interest to be in- 
tact from start to finish (Borich et al., 2012; Puig et al., 2011; Tang etal., 
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2010; Wakana et al., 2007). Moreover, tractography is a complex and 
time-consuming process, and results vary depending upon the particu- 
lar algorithm employed (Chung et al., 201 1 ; Ciccarelli et al., 2008). 

A second approach involves defining a three-dimensional VOI and 
then calculating the mean FA within the volume. A typical choice of 
VOI is the posterior limb of the internal capsule (PLIC). Mean FA is calcu- 
lated bilaterally within the PLIC VOls to determine FA asymmetry 
(Borich et al., 2012; Stinear et al., 2007). FA asymmetry correlates 
with current upper limb motor function (Lindenberg et al., 2010; Zhu 
et al., 2010), and can be used as a predictor of motor recovery in both 
chronic (Stinear et al., 2007) and acute patients (Jang et al., 2005; 
Stinear et al., 2012). 

The most commonly used method to define VOIs is manual tracing, 
in which experienced examiners delineate anatomical structures 
(Karnath and Perenin, 2005; Moro et al., 2008; Mort et al., 2003). Man- 
ual drawing methods remain the gold standard for the exact delineation 
of anatomical structures as they require fewer computing resources 
than tractography-based approaches and are more intuitive for clinical 
use. However, shortcomings of manual tracing include: being limited 
to regions identifiable by anatomic landmarks (Eckert et al., 2008); 
being labor intensive (Ashton et al., 2003; Seghier et al., 2008); and er- 
roneous inclusion of structures such as gray matter and other tracts 
(Holodny et al., 2005; Park et al., 2008). For these reasons, an automated 
method of VOI delineation may be preferable. Automated methods are 
much faster, and they may also minimize inter-examiner disagreements 
(Wilke et al., 201 1 ). However, automated methods may have shortcom- 
ings for studies of stroke patients, such as inadequate compensation for 
the structural distortions introduced by lesions that may result in gray 
matter or CSF being included in the VOI. Additionally, there may be in- 
adequate correction for anatomical variability between subjects (Fiez 
et al., 2000). A compromise between manual and automated ap- 
proaches exists in which VOls produced by an automated technique 
are manually edited as required to correct any inappropriate inclusion 
of gray matter or CSF. It is not clear whether manual or edited VOI delin- 
eation methods produce measures that are the more reliable and 
accurate. 

The aims of this study were to compare the reliability and validity of 
manual and edited VOI methods for defining the PLIC on MR images 
from sub-acute stroke patients. We hypothesized that editing registered 
templates would produce valid and more reliable FA values, compared 
to the more time-consuming process of manual delineation. 

2. Material and methods 

2.1. Participants 

Participants were recruited if they were at least 18 years old and 
had experienced a first-ever ischemic stroke resulting in persistent uni- 
lateral upper limb impairment. Exclusion criteria were any neurological 
or other conditions that would prevent informed consent or hinder the 
acquisition or interpretation of the data, such as cognitive or communi- 
cation deficits, previous stroke, and contra-indications to MRI. Partici- 
pants were screened using a MRI safety checklist. The study was 



approved by the regional ethics committee, and all participants provid- 
ed written informed consent, in accordance with the Declaration of 
Helsinki. 

There were 30 participants in this study (20 females; mean age 68 
years, range 31 to 92 years; 19 with right hemisphere lesions and 11 
with left hemisphere lesions; Table 1, Fig. 1) and all except one was 
right-hand dominant before their stroke. All participants completed 
MR imaging within a mean of 11 (range 4 to 22) days of stroke. At a 
mean of 13 (range 5 to 23) days after stroke, a clinical assessor evaluat- 
ed stroke severity using the National Institutes of Health Stroke Scale 
(NIHSS), motor impairment of the affected upper limb using the Fugl- 
Meyer (FM) scale, maximum score 66 (Fugl-Meyer et al., 1975); and 
upper limb function using the Action Research Arm Test (ARAT), maxi- 
mum score of 57 (Lyle, 1981 ). The clinical assessor was blinded to the 
MR images. 

2.2. MR data acquisition 

Scanning was performed using a Siemens Magnetom Avanto 1.5 T 
MRI system. To provide anatomical reference, Tl -weighted images were 
obtained with a 3D MPRAGE sequence (TR = 11 ms, TE = 4.94 ms, 
field-of-view = 256 mm and voxel dimensions of 1.0 x 1.0 x 1.0 mm) 
aligned to an axial plane parallel to the anterior and posterior commis- 
sures (the AC-PC line). 

Diffusion tensor imaging was conducted with a single shot spin echo 
EPI pulse sequence (factor = 128, TR = 6700 ms, TE = 101 ms, field-of- 
view = 230 mm and voxel dimensions of 1.8 x 1.8 x 3.0 mm) with 30 
uniformly distributed (Stejskal and Tanner, 1965) motion-probing gra- 
dient orientations (p = 2000 s/mm 2 ). Head movement was constrained 
with expandable foam cushions. MR images were visually inspected for 
motion artifact or instrumental noise. Scanning was repeated if major 
artifacts were present. Overall time in the scanner was approximately 
20 min per participant. 

2.3. Manual PLIC delineation 

Each examiner pre-processed the images and manually delineat- 
ed the PLIC VOIs using FSL (FMRIB Software Library, Oxford) (Smith 
et al., 2004; Woolrich et al., 2009). Four independent examiners 
(expert: V.K.; examiner 1: M.P.; examiner 2: C.Z.; examiner 3: E.V.) 
performed cross-sectional VOI delineation of each PLIC. V.K. is a neu- 
rologist with clinical experience in interpreting MR images, and 
hence was deemed to be an 'expert' examiner for the purposes of 
this study. The other 3 'novice' examiners were medical researchers 
who had previous experience in identifying the pertinent structures. 
None of the examiners had prior experience in VOI drawing and so 
were trained in the use of the software packages and the required 
workflow. 

Examiners used the Tl -weighted images to delineate the PLICs and 
no other images were consulted while drawing in order to prevent 
any bias that FA maps could introduce when determining the PLIC bor- 
ders. Training for the VOI drawing task was accomplished using previ- 
ously delineated examples in reference atlases of healthy brains. 




Fig. 1. Lesion overlap map. Images have been flipped so all lesions appear in the right hemisphere. 
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Table 1 

Participant demographics. 
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F = female; M = male; y = years; DH = dominant hand; R = right; L = left; SH = stroke-affected hemisphere; Lesion location: ALIC = anterior limb of the internal capsule; BS = 
brainstem; CN = caudate nucleus; CR = corona radiata; EC = external capsule; FT = frontal lobe; GenIC = genu of the internal capsule; Ml = primary motor cortex; PUC = posterior 
limb of the internal capsule; PMC = premotor cortex; PT = parietal lobe; Put = putamen; Thai = thalamus; TP = temporal lobe; TSS = time since stroke when MR1 was performed; 
d = days; N1HSS = National Institutes of Health Stroke Scale; ARAT = Action Research Ai m Test; FM = Fugl-Meyer Upper Limb Scale; Edit = registered templates were edited as they 
encroached on the ventricles and/or thalamus. 



Examiners were blinded to the other examiners' evaluations and to 
other marked-up scans of the same patient that they may have previ- 
ously processed. 

Delineation of the PLICs was performed in both the ipsilesional and 
contralesional hemisphere, in a voxel-wise fashion using the drawing 
tools in FSLView. Examiners worked on the axial plane and began at 
the inferior slice that best corresponded to the AC-PC line. In the case 
of complete destruction of the ipsilesional PLIC, the region was estimat- 
ed as a mirror volume to the contralesional PLIC (Stinear et al., 2007). 
The PLIC typically occupied 20 or more image slices but examiners man- 
ually delineated the PLIC on the 10 most inferior slices to allow images 
to be processed in a timely manner. 



2.4. Registration of template PUC VOIs 

Each examiner first skull stripped the structural Tl -weighted images 
using the Brain Extraction Tool (BET) (Smith, 2002). This involved itera- 
tively specifying a skull strip threshold value, visually confirming that 
the BET process had been completed satisfactorily, and repeating if neces- 
sary. DWIs were also skull stripped using BET, using the same iterative 
procedure. 

The Johns Hopkins University (JHU) DTI -based atlas of white matter 
tracts (Hua et al., 2008; Mori et al., 2008; Wakana et al., 2007) was used 
to create the PLIC templates. FMRIB's Linear Image Registration Tool 
(Smith et al., 2004) was then used to perform a linear transform of the 




Fig. 2. Examples of PLIC VOIs produced by the expert examiner. A) Registered template VOIs. B) Edited template VOIs. C) Manually delineated VOIs. 
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Montreal Neurological Institute (MNI152) template to native patient- 
space and apply that same transform to the PL1C templates. Linear reg- 
istration was used, instead of non-linear registration; the latter has less 
clinical utility as it requires lesion masking and takes more time. 

Registered template VOls that encroached on the basal ganglia or 
ventricles were edited by each of the four examiners. Voxel-wise editing 
of the template PLIC (overlaid onto the axial view of each Tl -weighted 
image) was performed using the drawing tools in FSLView (Smith 
et al., 2004; Woolrich et al., 2009). See Fig. 2 for example of PLIC VOIs 
produced by the expert examiner. 

2.5. Spatial similarity analysis 

The Dice Similarity Index ( DS1 ) ( Dice, 1 945 ) was used to quantify the 
spatial similarity between the VOIs produced by the expert examiner 
and each of the non-expert examiners. DSI ranges from 0 to 1 and is cal- 
culated to be twice the number of overlapping voxels between two 
VOIs, A and B, divided by the total number of voxels contained in both 
VOIs; DSLvb = 2 x (A n B) / (A U B), where n is the intersection and U 
is the union. If two VOIs contain the same number of voxels, and 50% 
of their voxels overlap, this results in a DSI of 0.5. 

2.6. Fractional anisotropy 

Diffusion-weighted images were corrected for motion and eddy cur- 
rents before using FMRIB's Diffusion Toolbox to compute diffusion ten- 
sors and FA maps. The FA maps were then registered to each 
participant's Tl -weighted image, and masked using the PLIC VOIs pro- 
duced by the four independent examiners. Mean FA values within the 
contralesional and ipsilesional VOIs were used to calculate FA asymme- 
try for each dataset. FA asymmetry values were calculated as: 
FA asymmetry = (FA C - FA,) / (FAc + FA,), where FA C = mean FA in 
the PLIC of the contralesional hemisphere and FA, = mean FA in the 
PLIC of the ipsilesional hemisphere, yielding a value between —1.0 
and +1.0 for each participant. Zero indicates symmetrical mean FA in 
the PLICs and positive values indicate relatively reduced mean FA in 
the ipsilesional PLIC (Stinear et al., 2007). No provisions were made 
for handedness or sex, as previous research indicates that these factors 
do not influence FA values within the PLIC (Buchel et al., 2004; Takao 
et al., 201 1 ; Westerhausen et al., 2007). 

2.7. Statistical analysis 

The sample size was calculated by determining the minimum num- 
ber of participants required to detect an intra-class correlation coef- 
ficient of at least 0.8 for FA measures. With 4 independent examiners, 
a = 0.05 and [i = 0.20, a minimum sample of 28 participants was re- 
quired (Walter et al., 1998). 

Spatial similarity between the PLIC VOIs produced by the expert and 
non-expert examiners was analyzed with a RM-ANOVA of the DSIs, 
with factors: method (manual, edited), examiner (1, 2, 3) and hemisphere 
(contralesional, ipsilesional). Mean FA values obtained from manually 
delineated and edited template VOIs were analyzed with a RM- 
ANOVA with factors: method (manual, edited), examiner (expert, 1, 2, 
3) and hemisphere. FA asymmetry was analyzed with a RM-ANOVA 
with the factor examiner. 

To investigate the inter-examiner reliability of the manually delin- 
eated and edited template methods, separate two-way random effects, 
absolute agreement intraclass correlation coefficients (ICC) for single 
measures were calculated using the mean FA values from the ipsi- and 
contralesional hemispheres and the FA asymmetry values. According 
to accepted criteria, ICC values of 0.1 1 -0.2 are considered "slight" agree- 
ment, 0.21-0.4 are "fair", 0.41-0.60 are "moderate", 0.61-0.80 are "sub- 
stantial", and 0.81-1.0 are "almost perfect" agreement (Landis and 
Koch, 1977). 



The ecological validity of the manual and edited VOI delineation 
methods was evaluated by calculating a simple linear regression be- 
tween the contralesional FA, measured by each examiner, with age to 
see if the correlations were negative, as expected. Further validation 
was performed by calculating separate linear regressions between FA 
asymmetry and ARAT score, FM score, and NIHSS score. We expected 
each of these regressions to be significant, as correlations between 
motor deficit and FA asymmetry have been reported previously 
(Lindenberg et al., 2010; Puig et al., 2011 ; Stinear et al., 2007). 

Statistical analyses were conducted using SPSS software (20.0 
SPSS Inc., Chicago, USA). Statistical results were deemed significant if 
p < 0.05. Greenhouse-Geisser corrections were undertaken when sphe- 
ricity was violated. 



3. Results 

The registered template VOIs were found to encroach on basal gan- 
glia structures, ventricles or both in 13 of the 30 images. The examiners 
independently edited these templates, prior to calculation of mean FA 
values, FA asymmetry, and Dice Similarity Indices. The edited template 
dataset therefore contained data obtained using 1 3 edited and 1 7 uned- 
ited registered templates. The edited template dataset was compared 
with the manually delineated dataset. 

There was a main effect of method on the spatial similarity of the PLIC 
VOIs (F! 29 = 9.41, p = 0.005). When the examiners edited templates as 
required, the non-expert examiners' VOIs were more spatially similar to 
the expert examiner's VOIs (edited mean DSI = 0.725, 95% CI 0.639- 
0.810; manual mean DSI = 0.588, 95% CI 0.559-0.617). There were no 
other main effects or interactions (all p > 0.05). The mean number of 
transverse slices included in the PLIC VOIs was 10 (range 9-15) for the 
manually delineated VOIs, and 20 (range 13-28) for the edited template 
VOIs. 

For FA measured in the PLIC VOI of each hemisphere, there were 
main effects of method (F 12 9 = 4.40, p = 0.045), hemisphere (F 12 g = 
18.65, p < 0.001 ), and examiner (F 3 87 = 17.99, p < 0.001 ), and no inter- 
actions (all p > 0.10). Overall, mean PLIC FA values were lower when 
templates were edited as required (edited mean FA = 0.461, 95% CI 
0.443-0.479; manual mean FA = 0.472, 95% CI 0.456-0.489). As ex- 
pected, mean FA values were higher in the contralesional PLIC VOI 
(contralesional mean FA = 0.498, 95% CI 0.484-0.512; ipsilesional 
mean FA = 0.436, 95% CI 0.408-0.464). The main effect of examiner 
arose because examiner 3 produced PLIC VOIs with both methods that 
resulted in higher FA values across both hemispheres (examiner 3 
mean FA = 0.485, other examiners' mean FA = 0.461 , 95% CI of the dif- 
ference in means 0.017-0.031). The mean FA asymmetry was 0.07 
(range — 0.04-0.43), with no effects of method or examiner (all p > 0.3). 

Intraclass correlation coefficients indicated substantial agreement 
between examiners for the ipsilesional FA values produced by both 
the edited and manual methods (Table 2). Agreement was near perfect 
between examiners for the contralesional FA and FA asymmetry values 
produced by both the edited and manual methods. 

There were negative correlations between contralesional mean FA 
and age when examiners edited template VOIs as required, but not 
when they manually delineated the VOIs (Table 3 ). There were negative 
correlations between FA asymmetry and ARAT and FM scores, when ex- 
aminers used both methods of VOI production. There were positive cor- 
relations between FA asymmetry and NIHSS score for 3 of the examiners 



Table 2 

Intraclass correlation coefficients, with 95% confidence intervals. 



Method 


Measure 








Contralesional FA 


Ipsilesional FA 


FA asymmetiy 


Manual 


0.622 (0.414-0.784) 


0.870 (0.782-0.930) 


0.872 (0.792-0.930) 


Edited 


0.617 (0.431-0.773) 


0.864 (0.778-0.925) 


0.896 (0.828-0.943) 
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Table 3 

R values for linear regressions between FA parameters. 



Expert examiner 



Examiner 1 



age and clinical scores. 
Examiner 2 Examiner 3 



Contralesional FA and age 

Manual -0.267 -0.360 -0.288 -0.192 

p = 0.153 p = 0.050 p = 0.123 p = 0.309 

Edited -0.529 - 0.458 - 0.512 - 0.408 

p = 0.003 p = 0.011 p = 0.004 p = 0.025 

E4 asymmetry and ARAT score 

Manual -0.430 -0.369 -0.422 -0.473 

p = 0.018 p = 0.045 p = 0.020 p = 0.008 

Edited -0.454 -0.458 -0.459 -0.439 

p = 0.012 p = 0.011 p = 0.011 p = 0.015 

E4 asymmetry and FM score 

Manual -0.429 -0.375 -0.426 -0.458 

p = 0.018 p = 0.041 p = 0.019 p = 0.011 

Edited -0.412 -0.433 -0.437 -0.436 

p = 0.024 p = 0.017 p = 0.016 p = 0.016 

E4 asymmetry and NIHSS score 

Manual 0369 0.347 0.403 0.420 

p = 0.045 p = 0.060 p = 0.027 p = 0.021 

Edited 0.464 0.543 0.442 0.453 

p = 0.010 p = 0.002 p = 0.015 p = 0.012 



when they manually delineated VOIs, and for all of the examiners when 
they edited template VOIs as required. 



4. Discussion 

The main finding from this study is that multiple examiners can edit 
registered template PL1C VOIs as required and achieve greater spatial 
similarity and ecological validity than when they use a more time- 
consuming manual delineation method. Editing template PL1C VOIs 
seems viable in the clinical setting, and may allow the use of FA asym- 
metry measures for predicting motor recovery to become part of rou- 
tine stroke rehabilitation practice. 

The procedural skull-stripping required some decision-making by 
each examiner, as they iteratively determined the optimal threshold 
for the brain extraction tool. The subsequent registration of template 
PLIC VOIs to the participant's Tl image was relatively quick, but did 
not always account for anatomical variations that exist in older or 
stroke-affected brains (Fiez et al., 2000). The template PLICs were de- 
rived from a MRI database of 152 young adult brains (Mazziotta et al., 
1995), so it is unsurprising that the template may not ideally fit older 
brains. Editing the registered PLIC VOIs was required for the scans of 
13 out of 30 participants, to remove voxels that encroached on ventri- 
cles or basal ganglia gray matter. 

The spatial similarity of the VOIs produced by the expert and non- 
expert examiners was higher when template VOIs were edited as re- 
quired, compared to fully manual VOI delineation. The mean overlap 
of non-experts' VOIs with the expert's was 59% for manually delineated 
VOIs, and 73% for edited template VOIs. VOI delineation methods are 
highly sensitive to examiner effects, as the boundary of the fiber tract 
is often difficult to define and varies between examiners (Borich et al., 
2012; Hong et al., 2008; Tang et al., 2010; Wang and Melhem, 2005; 
Yu et al., 2009). The decision on where to define VOI boundaries can re- 
sult in substantial differences in mean FA values, even when the VOIs 
are centered on the same location (Basser and Pajevic, 2000). The pres- 
ent data indicate that editing registered template VOIs is less sensitive 
to examiner effects than manual drawing methods, as examiners effec- 
tively 'start in the same place'. Written protocols, video-based tutorials, 
or DTI-specific training, have also been suggested as ways to reduce 
inter-examiner variability in MR analyses (Bonekamp et al., 2007; 
Brown et al., 2011). 



We also found that manual delineation of PLIC VOIs produced higher 
overall mean FA values than edited template VOIs, possibly because the 
examiners more precisely excluded subcortical gray matter when draw- 
ing the VOIs by hand. However, it should also be noted that examiners 
only manually delineated PLIC VOIs for around 10 image slices, 
whereas the template VOIs extended more superiorly and included ap- 
proximately 20 image slices. The superior portion of the PLIC, where it 
transitions to the corona radiata, may have lower FA values due to the 
'fanning out' of axons and crossing fibers. This could contribute to the 
lower mean FA values obtained from both hemispheres when exam- 
iners edited template VOIs. One of the non-expert examiners produced 
consistently higher mean FA values than the other examiners. This was 
likely due to having a conservative bias and restricting the PLIC VOIs to a 
smaller volume, across both hemispheres and with both methods. 

There were no effects of method or examiner on FA asymmetry, and 
both methods of VOI delineation resulted in 'almost perfect' agreement 
between examiners for FA asymmetry values, despite a lesser agree- 
ment between examiners for contralesional FA values (Table 2). This in- 
dicates that FA asymmetry is a more robust measure than mean FA 
values for clinical application in stroke. This is because FA asymmetry 
detects lateralized white matter damage while compensating for exam- 
iner effects and differing methodologies, and normalizes between- 
subject differences in overall white matter integrity. Furthermore, the 
FA asymmetry measures produced by both methods were ecologically 
valid, as they correlated as expected with clinical scores (Table 2) 
(Jang et al., 2005; Stinear et al., 2012; Thomalla et al., 2004; Watanabe 
etal., 2001; Yuet al., 2009). 

Contralesional mean FA did not correlate with age when PLIC VOIs 
were manually delineated (Table 3). This may have been due to 
under-sampling of the FA data, as the manual VOIs were limited to 10 
transverse slices by the time required to draw them. A potential advan- 
tage of editing template PLIC VOIs was that the templates had a greater 
volume, and more of the FA data were included in the subsequent anal- 
yses. When PLIC VOI templates were edited as required, contralesional 
mean FA was negatively correlated with age, as expected. 

To date, a direct comparison between edited and manual methods of 
VOI-based analysis in the PLIC has not been conducted. Previous inter- 
examiner reliability comparisons have mainly used the tractography 
approach, and been performed either with healthy subjects or patients 
with other neurological conditions such as head injury or amyotrophic 
lateral sclerosis (Borich et al., 2012; Hong et al., 2008; Ozturk et al., 
2008; Qiu et al., 2011; Tang et al., 2010; Wakana et al., 2007; Zhang 
et al., 2008). One study that did investigate inter-examiner reliability 
using the manual method in chronic well-recovered stroke patients re- 
ported inter-examiner reliability values of ICC(3,1) = 0.37 for the 
contralesional side and ICC(3,1) = 0.72 for the ipsilesional side 
(Borich et al., 2012). Our study sample size is relatively large and 
employed more examiners compared to other related reliability studies 
in stroke patients (Borich et al., 2012; Qiu et al., 201 1 ; Tang et al., 2010). 
The participant group was more heterogeneous for age, functional im- 
pairment and lesion location. None of the participants had a history of 
previous stroke and the time delay between stroke onset and MRI ac- 
quisition was relatively short. Participants also had greater upper limb 
impairment overall, indicating worse stroke severity compared to sub- 
jects in previous studies (Borich et al., 2012; Qiu et al., 2011; Tang 
et al., 2010). 

The heterogeneous nature of our study sample and the enrolment of 
4 examiners pose a more realistic challenge to the reliability and validity 
of each method in a clinical setting. We found that editing registered 
template PLIC VOIs as required allowed more of the PLIC to be included 
in analyses and improved the spatial similarity of VOIs between exam- 
iners, while avoiding examiner effects on subsequent FA measures and 
improving their ecological validity. We propose that editing registered 
template PLIC VOIs as required is feasible in a busy clinical environment, 
and may allow the use of FA parameters for prognoses at the sub-acute 
stage for post-stroke recovery of motor function. 
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